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The research reported in this paper was ^designed to 
analyze the incidence of use of higher-order rules .by students 
solving geometric Construction problems. A carefully selected set of 
construction problems was subj'ected to rigorou^s a priori analysis by 
mathematicsf educators to determine what basic and second-order rules' 
might be us\d by able high school students in their solution. 
Categories of- problems analyzed include: patterns of two loci, 
patterns of similar figures, combined two loci and similar figures, 
patterns o'f auxiliary figures, and patterns of loci, sii^ilar figures, 
^nd auxiliary figures. The analysis was successful in ma^king nore 
precise the heuristic approach of George Polya. Overall, the 
viability of this method of analysis was demonstrated. The authors 
cite some limitations of the study and future directions for their 
work. (SD) t 
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According to Polya (1*962), perhaps the greatest value to be gained from the 
study of mathematics is the ability to solve problems. In spite of its importance, 
however, relatively little is known about how to teach people to solve problems, or 
how to program computers to do so. Specifically, one of the great mysteries of our 
time is why some problem solvers (human or computer) succeed on problems for which 
they have all of the necessary component skills (operators) whereas others fail. 

In dealing with this question most research. In AI has been concerned with the 
construction of powerful computer programs v)hich can solve moro or less diverse classes 
of complex problems. In computer simulation an attempt is made to also pj^rallel human 
performance on such problems. Iff general, such systems (e.g., Newell & Simon, 1972;. 
Minsky & Papert, 1972) have been comprehensive in scope; they have been cpncemed with, 
proi^lcm definition (the construction of subgoals), memory, the derivation of solution 
l^rocedures, and the use of such , procedures. 

The present research ha's^ adopted a somewhat different strategy. It seeks 
understanding by pealing separately with the various aspects of problem solving (e.g.i^MiKij 
the derivation of solution procedures). In particular, this , research is concerned wi| 
the specification and testing of general, potentially useful heuristics ^or^ construct^ 
ing procedures for solving compass and straightedge construction p^roblems in geometry? 
The research also was concerned with developing and determining the feasibility of a bbcbdit?! 
general method by which heuristics may be identified iff arbitrary problem domains. 

One general poilfit of departure was Polya' s (1962) work on heuristics for geome^^£^t 
Construction problems. These heuristics are purposely; cast in a form designed to parl^^^r^:: 
lei human thought processes in much the same way as a)re such general heuristics as me^s^^-^: 
*ends analysis (e.g., Newell & Simon, 1972). Human processing presumably is highly 
^efi^ciijent in many situations, and the importance o£ paralleling human processing in AI, 
as well it) computer simulation, has become increasingly well recognized as a means of ^ 
Significantly reducing processing time. Winston (1972), for example, has noted how 
constralQing syntactic procedures to reflect underlying semantics- in the recognition 
of block scenarios can drastically reduce the number of possibilities that nxust be 
considered. 

In spite of the broad acclaim for Polya's*work generally, however, and the in- 
trinsic support for his notion of heuristics specifically, it sometimes has been 
Jifflcult to -capitalize on these ideas as fully as might be desired. Although often 
useful, his jfieuristics frequently are little more than general hints, and leave much 
to be desired insofar as pinpointing what a human or computer must know iuTorder to 
solve specific kinds of problems. In order to lend themselves to technological treat 
ment, heiiristics must be transformed or incorporated into strictly mechanical proce- ■ 
dures that can be more^or less readily lmplemented>on computers. ilHeally, one might 
desire reductiiln of heuristics to algorithms; witness the alpha-beta "heuristic" (e.g., 
^ilsson, 1971). . ^ 

Since heuristics tend to be (problem) do"main specific^, the potential value of 
more or less general and systematic methods for specifying heuristics in arbitrary 
problem- domains seems fairly clear. Our 'approach to this problem was designed^o hre^ 
compatible with Scandura*s (1973) theory of structural learning, and i^B an extWsion. 
of a method u^ed earlier by Ehrenpreis and Scandura (1972). That portion of the theory 
with/which this research is most concerned has been shown empirically to reflect the 
behavior of .individual subjects. in particular situations where problem definition and 
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with some empirical verlfUftlon (Scndur.. 1973.; Scndur.. 1973b) to be extendable to 
situation, involving memoty. and. apparently, al.o problem definition (Scandur., 1973.. 
o. 348) and perception (C^V5). without ea.ent 1^1 change. The structure of th^ theori 
mst be enriched In these.^.ae. but without affecting its basic character (I. el, under- 
lying l)ehavlor mechanism) Thl. re.earph I. b..ed directly on one part of the Idealized 
-heory, In particular thaij part which la concerned with competence-the specification of 
rale sets whl^h account for •clas.es , of problems. In this theory, a rule set Is said to 
.account for a class of problems, roughly speaking. If for each problem in the class (1) 
ther^^^ls a solution rule, (operator) In the rule set which has- the problem in Its domain 
and whose range contains the solution to the problem or (2) there is a higher order rule 

the rule set which applies to rules In the set and generates a solution rule. In 
such a rule set., higher order rules correspond to-heurlstlcs. (For a more general and 
formal formulation, which allows for any number of levels of derivation and in which the 
rules are not in a fixed hierarchy, see Seandura. 1973a, Ch. 3; 1973b.) 

It seems unlikely, of course, that algorithmic methods can be found for devising 
nontrlvial rule sets or heuristics. Indeed, as Chomsky (1968) has argued in the case of 
.linguistics, no such method exists for dealing with observables as complex as language. ' 
Work in automatic programning. on the other ha«d,- while it is quite far at present from 
a satisfactory solution. Is proceeding as the authors-* understand It. on the assumption 
that significant progress In this direction can be made. 

. In the presenr research, the task of specifying heuristics is made simpler In at 
least two ways. First, and most Important, the type of competence theory proposed im- 
poses Important constraints on the nature of allowable rule sets, and In turn on the form 
of the heuristics '(lilgher order rules). In partlcylar. higher order rules are assumed 
to operate on component (lo^er order) rules to generate Integrated problem solution rules 
^procedures) . These rules may simply compose component rules but maylalso modify them. • 
tor example, by generalization or restriction rules (Seandura. 1973a).' ' , 

Second, restricting the level of analysis to that of flow diagrams, rather than 
computer programs, makes It natural ^o represent the constituent operations and decision 
making capabilities at whatever level seems to most adequately reflect human knowledge 
rather than at a level predetermined by some programming language. (We db not mean to^ " 
minimise the importance of devising working program.. In fact, parts of this analysis 
Iwvebeen Implemented by one of the authors.)" While no general assurance can be given 
wtth regard to any particular pethod. it would seem that a method which results in 
heuristics (and simple operators) that appear consistent with human thought would have 
a reasonable chance of having general value. 

/ » METHOD OF ANALYSIS ^ 

Our method of analysis went something as follows. First", we attempted to set some 
reasonably explicit bounds On the class of geometry construction problems to be consider- 
ed. In partirular. we considered only those problems in or like those of Chapter 1 of 
Polya (1962). . . ' 

Our next step was to classify these problems on heuristic-intuitive grounds.- Our 
aim was to place similar problems in the ..me categories, in accordance with the gerteral 
formrof their solutions. We were one step up in this reg.rd. sihce Polya had already 
done- part of the categoriaation for us, AH of his problems can be solved according to 
some vaSpiant or combination of the three generq,! heuristic, he describes: (1) the 
pattern of two loci. (2) the pattern of similar figures, and (3) the pattern of auxiliary 
figures. . ' 

After the various tasks had been classified, we made sure that the domains and 
ranges of each task were. fairly explicit. ^^Then we identified explicit procedures for 
solving each type of task. Care was taken to inaure that these procedures reflected our 



hn». ...u'^ r ^S" "^'^ ^roUBB to be experts in AI or in computer simulation as such, 
but rather in the adjacent and we think complementary domairi- of structural psychology 
pMcesllng s°"h 1 ^^"'"e^^ly broader than contemp^ary cognitive and iJ format Ion 



'intuitions as to how intelligent high school students ailht go sbout solving the pro- 
blems. In some cases it was possible at this point to subclssslfy some of the tasks. 

. The most critical step was to identify general parallels among the procedures 
developed for the sampled probl-ems within eachof the various classifications, and even 
more important to devise higher order rules ^operator combination methods) which 
realized these parallels as relatively formal, but still general,, procedure's. The 
. higher order rules so identified (together with the component lower order rules on which 
they act) provided a general basis for constructing solution rules for the sampled 
problems. . ° 

Then we attempted to refine the resulting higher order rules with regard to 
specific sampled .problems. This was done systematically; where a higher order rule " 
failed to yield an adequate solution rule for a sampled problem, appropriate modifi- 
cations in the higher order rule were made. A serious attempt also was made to insure 
that the higher order rules were compatible with human knowledge. 

t PATTERN OF TWO LOCI sf 

Our first step was to select's broad sampling of two-loci problems, and to devise 
procedures for solving each. For example, consider the problem: "Giveti a liiie and a 
"point not on the line, and a radius R, construct a circle of radii^ R which is tangent 
to the given line and which passes through the given point." This problem can be solved 
according t^o the following procedure: "Construct the locus of points at distance R from 
the given point; construct the locus of points at distance. R from the given line; con- 
struct a circle using the intersection point of the two loci as center, apd the distance 

R as radius." t i.^ ' 

This solution rule clearly involves the pattern of two loci. In this case, as 
with all of the problems in Polya's first category, the tasks may be characterized 
according to the form of their solution procedures: two loci are determined one after 
the other; the point of intersection of these loci in turn makes it possible to construct 

the goal figure. . 

Further analysis of the class of two-loci problems, however, revealed differences 
in the ways problems are solved. In man/ solution for example, like the example 

above, the two loci can be found independently, in either order. Furthermore, at no 
point in the course of applying the solution rule is it necessary to measure a distance. 
Some form of distance measurement, however, is required with other tasks. JSome of the 
sampled tasks require measurement itt order to construct the goal figure; the solution 
rule for another problem involves measurement before the second locus can be found, m 
still another task, one of the loci is actually given, or equivalently, can be thought 
' of as obtained by applying an identity rule. The goal figure in still another task is 
simply the point of intersection of the two loci. 

An Initial characterization , ' ^ 

As a first step in characterizing a two loci higher order rule, we systematically 
went through the various solution rules for the pattern of two-loci tasks and identified 
all of the different component rules that appeared in our sample problems either (I) 
in constructing one of the loci, or (2) in constructing a goal figi^re. The lower order 
rules we identified were mostly common constructions ^e.g. , perpendicular bisector, 
circle; parallel line) I Some of the lower order component rules were used to construct 
a needed locus, others^ were involifed in constructing goal figures, and some setved both 

functions. ^^^^^^ ^^^^^ Figure 1 shows schematically how the various solution 

. rules may be constructed from the component rules. 



4. See Appendix A. 

Lists- of the component rules involved in our analyses are available in the 
unabtldged report. See footnote 1. " * 
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The higher order rule In Figure 1 applies to the problem (i.e., the atiimilua situation, 
Sq) and to the goal (G) itself, as well as to the lover order component rules 

First, an arbitrary representation (S^^t ^i) analogous to the solved problem is 
•constructed. In our illustrative task, a sketch lik^ "figure 2 would serve this purpose^ 



Figure 2 




Note that constructing such a representation is not the same either as 'solving the pro* 
blem, or as constructing a solution rule for the problem. The sketch in Figure 2, for 
example, can easily be generated by first drawing an arbitrary circle, then drawing an 
arbitrary line tangent to it, and placing an arbitrary point on it. More generally, an 
arbitrary representation (Ri) of the goal figure (R5) is Constructed first. Only then 
is a^representation (Si) of the information given in the stimulus situation (Sq) 'con- 
structed in relation -to the representiation of the go^l figure. In effect, the first 
operation on the higher order' rule amounts to representing geometrically the meanings of 
goal situations (i.e., goals plus stimulus situations) by a "sketch,^* or some equivalent 
representation.^ 

, The second step ia^^the question: "Is there a point X in (Si, Ri) which satisfies 

tWo locus conditions - and^ if so, is ):here a goal constructing rule (rg) such that 
point X is contained in the domain of rg^(Dom rg) and such that the range of rg (Ran rg) 
i$ contained in the goal, G?" 

As shown in Scandura (1973a), decision making capabilities can be characterized 
as partitions on a class of input situations; in the present case, .each representation 
(S^, Ri) either contains a point X which ^atisfies two locus conditions or it does not. 
If it does satisfy two such conditions, then the- next operation involves forming the 
ruiLe consisting of (1) a decision which asks whether there is a point X in the domain 
of|rg which satisfies two locus conditions, (2) the rule rg, and (3) stop. 

Next, the available component rules are tested to see whether there are two of 
th^m which apply to thei represented stimulus (Si) and generate loci which contain the^ - 
poitit X. Given that such loCus toil es exist, the next operation constructs the solutlofi 
rul^e R3 in which first one locus rule is applied (after testing to s^e whether the 
stimulus situation is in its domain), when tlie other r^^t, and finally the goal construc- 
tion rule rg.^ 

A more rigorous analysis 

I This level of description is sufficient to give one an intuitive feeling for how 
the higher order rule operates. But the rule la^mbiguous, especially for computer im- 
^leoientation purposes. In the first decision ^king capability, for example, it is not 
cleat just what constitutes a locus condition. Similarly, in the second- decision making 



. See Appendix B. 
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capability the notion, of a rule applying to a stiimilu^ situation is something less than 
precise* . > ' 

^ Closer perusal of the individual 'tasks made it possible to overcome these ambi- 
guities. In many cases, the desired point X is a. given distance from one or two given 
points and/or lines. In the example above, for instance* the point X is a distance R 
^rom the g^iven point and from the given line. This suggested the following more rigor- 
ous charocterization of the first decision making capability: 

(1) Does there exist a" point .X in <Si, R^) and a rule rg such that (X, E) is 
contained^ in the domain of rg where J is a givgn distance, and the range of rg is 
contained in the -goal (Ran rg CG) such that X is a given distance from one or two given 
points and/or lines? ^ , 

A similar analysis suggested reformulat.ing the second decision making capability 



as: 



(2) Is there a rule rj, such that a pair consisting of given points', lines, and/or 
distances in is in the domain of r^ (Dom Vj^) and such that X is a member of L (i.e., 
a point on L) where L is contained in the range of rL (XgLgRan rL)?- 

A similar characterization is required for r^i . 

A higher order rule incorporating these refinements can be used to generate solu- 
tion rules for many two-loci problems. For example, in the illustrative problem there 
is certainly a point X in the representation (S^, R^> which is at the given distance R 
from a given point and from a given linfe in S^. It is: also ti:ue that there, is an r 
rule which applies to the pair consisting of the point X and the given distance, ani 
whose range consists of circles and is thereby contained in the goal. 

Unfortunately, as it stands, th^Jrodtfied. higher order rule does not provide an 
adequate means for characterizing solution rules for other sampled two-loci tasks. In 
certain tasks, f6r example, no distance is given. We important requirement in such ' 
cases is often that the point X Jbe equidistant from a given pair of elements, points 
and/or lines, in two different instances (i.e., for two given pairs of elements). Thus, 
in -the tasks, ^'Inscribe a circle in a given triangle," the desired point X is equidistant 
simultaneously from two different pairs of sides of the triangle, or equivalently, the 
point X is equidistant from the three sides. 

Still other tasks involve the (lower order) rule for constructing the locus of 
vertices of an angle of given measure subtending d given line segment. The task, "Given 
side a of ,a triangle, the median M^, and the measure of angle A opposite side a, construct 
the triangle," is of this type. • -The locus of vertices, in this case, ls*an arc but the 
points on it are not at a fixed distance from any point on the given segment. Nor, are 
^ th^ poitlts of the locus equidistant from any two particular points on the line segment. 

In order to take these ^possibilities into account, the decision making capability 
was generalized so that the point X coiJld be equidistant from a pair of points or lines, 
or could sdrve as a vertex of an angle of «given measure whose sides subtend (i.e.. pass 
through the end points of) a given segment. Decision making capability (3) was also • 
enriched so that pairs consisting of angle measures and/or segments could be in the do- 
main of a locu6< rule. ^ " - 

Further, in the problem, "Given three intersecting lines, not all intersecting 
at a common point, construct a circle which is tangent- to two of the lines and whose 
center is on the third," we have a situation where one of the loci, the line contain- 
ing the point X, is already given. To handle this possibflity we simply assume an 
"identity" lower order rule, one which identifies a given line as a required locus. 

With these modifications, the higher order rule handled almost all of the pattern 
of two loci tasks we had' sampled. We ran into diffic'ulty, however, with another task: 
Given two parallel lines and a point between them, construct a circle which is tangent ' 
to the two lines and passes through the point." This difficulty involved the second 
^ decision making capahillty (3). There is a pair of lines in the domain of one of the 



locus rules • one vhlch constructs the locus of points equidistant from the two given 
parallel lines. The second locus rule, however, requires that we first measure a 
distance between two parallel lines, one of which is notv present in the stimulus JS^ 
until after the first locus irule is applied* Thaft is, we need to determine the distance 
between one of the parallel lines and the locus of points equidistant from the two given 
^parallel lines. This distance serves as the desired radius* . ^ 

Appl:(.catlon of the higher order rule in this case results in failure at decision 
miaking capability (3),. Fortjinately, it is easy to^modlfy the higher order rule to take 
this possibility into accoudt. ^ Furthermore, as we shall see, this modification serves 
an Important purpose in dealing with the larger class of construction problems solvable 
either by the pat t .em of two loci or by the pattern ofc similar figures* 

Instead of stbpplng when the second decision falls, we simply add another group 
of tests (A-.G) * (A) and (B) dupUcate (1) and (2) except that X must satisfy only one 
specific condition* (C) asks: ''Is there one component rule such that a pair of given 
points and/or lines is in the domain of , that rule and Is there a locus L such that the 
point X 1^ part of L and L is contained in the x;ange of rL?" If the answer to this is 
no, we stop, but if the answer is yes, we can ask whether there is another locus^ rule 
r^ such that the represented stimulus situation S]^, together with the preceding locus 
, r^CSj), contains a pair of given points and/or lines that are in the domain or 1:^1* 
A revised higher order rule whlc^ incorporates all of these inodlflcatlons is 
shown in Figure 3, found on the following page* 

In checking this higher order rule we found it to provide an adequate account not 
only of all ofVthe pattern of two loci problems sampled, but others' as well* For ex* 
ample; consider Task A: "Given sides a, b, and c of a triangle, construct the triangle*" 
In this case, application of thie higher order rule generates the solution rule* This 
solutloti rule Involves: (1) application of the rule, "Construct the locus of points at 
a given distance from a given point," to the end point of one line segment using another 
side as distance^ followed by (2) another application of the rule to Ihe other end point 
using the remaining side as radius* Then, the triangle rule, "from a point not on a 
given segment, draw segments to th^ end^polnts of the given segment^" is applied to\^the' 
Intersection of these two loclyfb "obtain the desired goal figure* y 

In some cases, of course, different lower order (component) rules were Involved* 
For J&xample, conslder^task B, "Given two intersecting lines and a point of tangency on 
orfe of the lines, construct a circle. which is tangent to the two lines and which passes 
' thro;ugh the given point of tangency." In this case, the lociis rule for constructing 
perpendiculars to lines through points on the given lines not been required with any 
of the sampled problems* ^ - 

Discussion ' ' ■ ' ^ ^ ^ • ' -^^--^ 

' * . ' * . 

Aside from' the possibility that new two-loci problems may require additional lower 
order rul^s, the higher order rule appears adequate* In particular, the higher order 
rule not only generates solution rules for each of the sampled two-loci problems, but 
also seems compatible Hlth human knowledge* 

As" the form of the higher , order tule suggests, the component deciision making 
capabilities play a crucial role in deriving solution procedures. These decision making 
capabilities are designed to reflect the underlying semantics of the problem situations 
by referring directly to ilgural representations of semantic information implicit in 
the problem descriptions* In general, parts of a flgural representation <Si, Ki) will 
represent the meaning of a task statement and reflect the relationjbetween the given 
stimulus (Sq) and the goal figure (R^)* Notice that while the relation between and 
will be the same as between and R^, and Rj will not in general be the same as 
and Rq, respectively* 
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/ For purposes of owe. analysis, the decision makfng capabilities were viewed as 
atopic although they can also be analyzed into more basic components. The first decision 
making 'capability in the second two loci higher order rule, for example, involves both a 
conjunction and disjunction of a number of simpler conditions. This decision making 
capability could be subdivided, for instance, into the following Uo decisions: (A) Is 
there a point X that is a given distance from a given point and/or line? (B) is there 
a point X equidistant from a pair of given points or lines? Instead of having one deci- 
sion making -capability Involving conditions A and'^B, then, we could have one decision 
making capability involving A, and a subsequent one, B.^» 10 " 

'In addition to its purported .compatibility with human ktfowledge, ^he higher order 
ruie is also sufficiently precise to be mechanizable. One of the authors (Wulfeck) has 
recently written a program in SNOBOL A which uses an intermediate version of the two ' 
loci-higher order rule ("see the unabridged report) to generate solution procedtires for 
many of the problems we sampled. A naming system replaced the figural represen-tation 
described above (see footnote 7). Routines corresponding to many of the lower order 
rules (see the appendices of the unabridged report for lists of the component rules) 
Were also written. , 

Granting the adequacy of the higher order rule for purposes of pur analysis, we 
wish to comment briefly on some li-mitations in regard to the compatibility of the lower" 
order rules with human knowledge, though the ^peq^if ication of component rules is not <^ 
central concern. These limitations are gll variants on a common theme: The lower 
order rules we have identified can be constructed from more basic components. This fact 
is reflected in at least three ways. 

First, many of "the simple rules have components in ^common. Several rules, for 
example, all involve constructing a locus of points (circle) at some distaf.ce from some^ 
point. The differences lie in whether or not the distance and/or center points are 
, given directly or must be determined first. The conrtruction rules needed- to determine 
these distances and/or center points are quite. basic and dre apt to be useful in a wide 
variety of construction situations.- Any reasonable account, designed to deal with a 
. wider variety of problem 8ituation,B, would undoubtedly include these- construction rules 
directly in the rule, set. 

Second, certain of the identified lower order rules, particularly the rule for 
constructing the locus of vertices of an angle of a given measure subtending a ^iven ' 
line segment, are complex in themselves and cannot automatically be assumed to be avail- 
able to many problem solvers. 

A third limitation is closely related to the first and was" mentioned earlier: 
the lower order rules are to some degree specific to the tasks we have identified. To 
some extent" this may be unavoidable because there are always certain problems which re- 
quire "trick" solutions. It would be desirab/e, of course, to ke^ this to a minimum. ' 
In this regard, it should be emphasised that the simpler the lower order rules the greater 
the problem solving flexibility. -\ . ' . 

'One way to modify oun characterization to handle these limitations would be to 
"reduce" the lower order rules into their components and, correspondingly, to "enrich" 
the higher order rule_by adding sub-routines for constructing the needed locus, rr, 
and goal, r^, rules. Such rules would correspond to the type of knowledge that a 
person just having been -taufght the basic construction rules would need to have in order 
to generate solution rules directly. 

For example, consider the rule: "Determine the distance between a given point 
and a given line and then construct the '^ocus of points at the obtained distance from, 
the given point." This rule can be divided into two subrules: (1) "Determine .the dis- 
tance between. a point and a line," and (2)' "Construct the locus of points at a given 
distance from a given point." To compensate for the reduction in the latter case, the 
higheif order rule could be "enriched" so that more complex rL and rg rulefe can be 

— " — f 

9- For a discussion of how new decision making capabilities are learned from' 
simpler ones, see Scandura (1973a). . 
' / ' . 

10. Such .refinement may be useful in the assessment of behavior potential (durnin & 
Scandura, 1973). specifically in increasing the precision of diagnostic testing. 

^— I See Appendix E. 
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' whpn It meets certain prescribed conditions, as we have done so far, we include in the 
h&er order rule ^ simple sub-routine for combining component lower order rules. Si^ch 
a sub-routine for example, might select , (sub) rules until one is found whose domain in- 
cludes a pair consisting of a point and a line (e.g., the distance measuring ruled). . 
and another (e.g., the circle rule (2)) such that its range consists of circles (loci). 
To make the search more efficient, it is natural to add the requirement that the range 
of the former ;be contained in the domain of the latter. After the component rules have 
been identified, the sub-routine would fonn the composite of these rules, au^d finally, 
would test the composite against the condition In the initial higher order rule. 

As attractive as this possibility might appear at first, a little thought ^ug- 
gests Its Implauslbllity as a way of modeling human knowledge. This can be seen by 
noting that all geometric .constructions with straightedge and compass are generated by 

.iust three basic operations:- (a) using a straightedge (e.g., to draw a line, ray ,. or 
l^ent through two given points-, or through one point, or intersecting a line, etc.J 
:|bX%awing an arc given a compass set at some fixed radius, and (c) givei^ two points, 
siit'iivg^a compass to the distance between those points. V , , ^ , „ 

• Vn-ms we have seen, many of the lower order rules are really quite complex. Re- 
ISili^ a higher order rule, designed to reflect human knowledge, to generate such rules 
flilemental components is unrealistic. It is unlikely that a subject who, is only abL 
toip^jfform the three indicated operations above would^also have at his command a rather 
convex and sophisticated higher order rule. The acquisition of such complex capabili- 
mmMnaiw subjects, whether of a higher or lower order; would almost certainly have 
loipLabout gradually through learning, presumably by interacting with problems in 
the environment, ^2 ' - ' / 
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F - PATTERN OF SIMILAR FIGURES 

Three classes of similar figures problems 

The pattern of similar figures problems' were analyzed in similar fashion. Again, 
we began witfi a broad sampling of problems from Polya (1962). One of the problems iden- 
ii-llfSd was' -'Given a triangle: inscribe a square in it such that one side th* square 
^ ?8 contained in one side of the triangle ^nd the two other opposite vertices of the 
squar^ fie on the other two sides of the triangle." The second step was to identify a 
so?uUon\ule for each of the problems. For the problem above the solution rule was 
"Construct a square of arbitrary size such ^hat one side is ^^^^^J^^J^J" ^^^^^J^/f 
' triangle which is to contain the side of the goal square, and such that one ^ej^ex is or 
/-anoZr side of the triangle. Draw a line through the point of intersection of those 
: ?wo s!des'of the tria;.gle%nd through the fourth vertex ^^/^e arbitrary square Jrom 
, the intersection of this line and the third side of the ^^^9"fi%if^^5/^f^'J^/°^i3ngl. 
vertex o£ the goal square) construct a segment perpendicular to the side ^^e triangl. 
wMch ifto contain. a side of the goal square. Complete the goal square using the 
length of the perpendicular segment as the length of the sides. 

Similar figur-es problems, like the example task above, may be characterized, as 
' those whi:i solution prLedufes Involve a similarity -PPin^ process, from som. c nter 
t^r^i^r'i nf «s<mllaritv a" figure or set of points "is mapped onto another, rurtner, tne 
^?itiin r^ Sire al^ays'^n^olve constructions according to geometric invariants unde 
similarity mappings, i^-ther parallel lines, since parallelism is preserved, or equiva- 
lents, "copying" angles, since similarity maps are conformal. 

Further analysis ^f the similar ^figures„ problems revealed three "J^^^^^^y ^i^" 
tmct classes of solution- rules. In the sample problems above and in ^^her problems 1 
Jhe same class, the solution rules all involve first const-ructing a square of arbitrary 
slL which is in the same orientation as. the desired goal square, /and which meets as 
many of the tasS conditions as possible. (Rules of this type fop/constructing similar 
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.tigures are denoted by r The second step in each solution rule 'uses two pairs qf ' 

corresponding points in Che goal and similar figures (i.e., in {S^, Rj) superlniposed 
with the, similar figure) to determine the point of similarity (Pg), and then, constructs 
a line through the point of similarity and a point on the similar figure which qorres- 
ponds to a needed point of the goal figure, (Point of similarity rule? are d^oted 
rpgj Finally, the obtained point oh the goal figure is used as a basis for constructing' ' 
trie goal square. ' , * ^ 

The second class is well represented Uy the problem, "Given angles B and C bf a 
triangle, and thevwedian Mg to side a, construct the triangle." The corresponding 
solution rules begin similarly by applying a^similar figures rule (rg) to two given 
angles to construct an* arbitrary sized triangle similar to thje goal -triangle, witli 
medians,' altitudes, etc., as required.^ Then a modified point of similarity rule (rpe) 
is u^ed to determine the point of simiVarity (Pg, the vertex of the non-given angle), 
and to construct the given Segment (e.g., Mg) , sucl\ that one endpoint of the segment is 

^the poiht of similarity, and such ^hat the segment coincides with the cotresponding seg- 
ment .ifi the similar tnapgle. Finally, a line is constructed,^ through the other end- 
point of the constructed' segmeot'parallel to the side of the similar triangle that £s 
opposite to the point of similarity. The remaining sides of the goal triangle are ob- 
tained by extending two sidesofjthe similar triangle to intersect the constructed 
parallel line. ^^^T^^^ 

The solution rules' for the third class of problems differ in that the first step 
in each'is to Use an r^ rule to construct a locus^of points which contains a critical' 
point, specifically the center of the goal circle. In the problem, "Given a line and 
two points (A and B) on the same side of the line, construct a circle tangent to the 
line which passes through thje two g^ven points, " for example, the locus of |)oints (L) 
Equidistant from the^ two given points contains the center of the, goal cirple^^* Alsa^ 
the -point of similarity is the intersection of the locus and the given line.' The Sec- 
ond step is to construct a similar figure (circle, C^^), thich satisfies part of" the 
goal condition. In our example, a circle is constructed with, center on the constructed 
locus and tangent to the given line. Next, another vers^on^of the po'lnt of similarity 
rule is applied; this time the point; of similarity (Pg) and a given point on the goal / ' 
figure (e.g., B) are used to determine a corresponding poinf (B*) on the simflaF circle. 
Then., parallel lines involving corre&iponding points are coniitructed to determine the 
center of the goal circle. Finally, the goal circle is actually constructed. 

The similar figures rule 1 * ' 

* ^ - j; '^ ' . *^ 

The higher order rule shown in Figure 4 (together with a set of applifatile lower 
order rules) provides a sufficient basis for solving aU of^the j^^^ 

lar^fi^r^es^f^ohlems. FurtheOT«"e , The higher order rule appears to reflect the underiyit^g 
semantics (Fig^rp 4 foutid on the following page). For example, let us see how a solution ^ 
rule for the first illustrative problem above (inscribing a square in a triangle) can 
be generated by application of the higher order rule. . Tfie tirst decision making ^ 
capability (A) asks essentially whether a point X is iieeded to setVe as the center for* 
a goal circle. As the goal figure is a square, the answer is obviously no. Decision- 
making capability J then asks if. there is a goal similar figure rule (r ) which applies 
to, representing stimulus and generates squares that >satisfy "part (GgT of the goal 
condition (i.e., the range of rgg is contained in Gg which In turn contains G - equiva- 
lently, anything which satisfies G, satisfies Gg, but not necessarily conversely) . The 
lower order rule, "Construct a, square in a triangle with one side coincident with one 
aide of the triangle and one vertex on another aidedf the tiriangle," satisfies thesk^ 
conditions do the rule is retrained as indicated in operatioA K. - 
Decision making capability L asks two things: (1) Is there a point Xg which 
corresponds to a missing point X in the goal squai^e? (2) Is there a rule r^ such that 
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Sftinulus S^, supplemented with the point X (l^e., XU^x), Is In the domarln. of Tg, 
and r'geaerates a gool-like figure (tan ^g^^)? 'In short, is there a point Xg in 
« the similar square which corresponds to a point ^ from which the goal i^quare may be 
'constructed? Clearly, there- is such a point Xg and the rule, ''Determine the distance 
from a point to a given line segment and construct ^a square with sides of that length" 
satisfies* the necessary conditions. Operation M forms the solution rule consisting of 
'the* two )rulcs above with the point of similarity^rule between ^thcm. j 

To see how the- higher order rule works wiqh the second dlass of problems, con- 
sider the* second illustrative problem abpve (constructing a triangle, given two angles 
and a median). Ih this case, the lanswers to decision making capabilitie'^ A and J are' 
again "no" and "yes," respectively. Heije., rgg is, "Construct a tri-angle of arbitrary 
»size using two given angles and add parts correspondJ^ig, to given segments." At deci- 
si6n making capability L there is a point X in the goal* figure,^ the endpoint of median 
which can be specified by rpg. Operation M again forms the solution rule. 
Notice that the first two classes of problems involve the same path in the 



higher order rule. Each solution rule requires a goal similar figure rule (rgg), the 'pbint 
of similarity rule (r{>g), and^a goal constructing rule ^rg) . The only difference is" 
whether the goal and similar figures are squares or triangles, ^ith all that^ impJiesHhor 
the particular rg^ and rg rules required. In short, this example illustrates how what 
ntay appear initially to^oe basically different kinds of problems may turn out to have a 
common genesis. ^ 

The third problem (constructing a circle tangent to a given line and passing 
through two given points) illustrates the other path through the higher order rule. In 
tlifs case, if we knew the center (X) of the desired circle^ we could solv^ the task. 
Furthermore, this missing point X is on a locus, namely the locus of points' equidistant 
from the two given points. Hence, decision making capabilities A and C are satisfied, 
and we retain the circle constructing rule (rg) and the peHrpendicular bisector rule. 
Decision making capability F asks if there is a rule (^gg) which applies to the stimulus 

as modified by the output of thexocus rdle (i.e., Sj^ U rpg (Sj^)), Condition F As 
satisfied Jby a rule that generates circles with cehters on a given line (the locus) and 
tangent to another given line. The answer to the decision making capability i«s also 
"yes." The two given points on the goal figure obviously correspond to two points on 
the similar circle* By operation I, the solution rule foll£>ws directly: "^Construct 
the locus of points equidistant from the two given pointsf^on struct a circle with center 
on that locus tangent to the given line; apply the point of similarity rule, and then 
the parallel line rule to determiti|j'^ the center of the go£ll circle; construct the goal 
circle using this center and the distance between it and a given point of radius.*' 

It should be noted that in one of the sampled tasks the "locus'* is given. The 
easiest way tq handle this special case is to fimply add an identity locus constructing 
rule as before. It would also be a simple .matter to modify Vhe higher order rule to 
take this possibility into account by asking, prior to or at decision making capability 
C^ whether there is a line in which contains X. 

v: ^ 

Combined .rule for two^loci and similar figures problems 

, It would appear from our analysis that the two higher order rules, together with 
the necessary lower order rules, would pirovide an adequate basis for solving the 
sampled two loci and similar figures problems and others like*them. Indeed, there are 
*two possible modes of s/6lution in the case of one of the sample^d similar figures tasks: 
"Inscribe a square in a right triangle so that two sides of the square lie on legs of 
the triangle, and one vt^rtex' of the square lies on the hypotenuse." Instead of using 
the pattern of similar figures, «• illustrated in our first example, the pattern of two 
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iocl rule can be \ised to construct the bisector ot the right angles The Intersection 
of this iocus with the hypotenuse (the other locust is the "missing point" X and pro- 
vides a sufficient basis for constlructing the goal square. 

Although it is not always critical tb distinguish between different mode^ of 
problem solving, any > complete account designed to reflect human behavior must specify 
why one mode of solution is to be preferred over another .(cf •^.Scandura, 1973a, Ch. 8). 
•In the present case, there are two possible ways of handl4ng this. First, we can add a 
'higher order selection rule to the rule set which sAys simply. If both higher 'order 
rules apply, select the pattern of two locl.^ The rationale is that the pattern of two 
loci rule will generally yield a simpler method of solution. 

A second' way to handle the problem is to devisfe a single higher order rule which 
combines the advantages of both higher order mles. The Jilgher order *iailes in Figures 
3 and 4 can be combined to yield the higher order rule deleted in Figure 5. The. path 
in this higher order rule designated <1,2,3,4) corresponds to tjiat p4th of, the two 
, , ^ - ;/ I I ."^ ' loci higher order rule which deals with those 

cases where the two loci may b§ found in either order. The path (1,2,3,A,B,C,D,E,4) 
deals with those two-loci p^blem^ where one locus^must be found before the other. The 
other two paths correspond t^the similar figures higher order *rille. ^ 

PATTERN OF AUXILIARY FIGURES 

* Not all compass and straightedge problems can be solved via the pattern of two 
loci or the pattern of similar figures. In this section, we describe a higher order 
rule for dealing with the third class o^ problems identifjLed by Polya (1962)', the pat- 
tern of auxiliary figures. We al*8o show how the combined higher order rule (above) may 
be extended to account for essentially all of the construction problems identified by 
Polya (1962). . ' • * 

Auxiliary figures higher order rule ' 

Our initial analysis was based on a sample of five diverse auxiliary figures 
problems. One of the problems used was, "Given the three medians of a triangle, con- 
struct the triangle." < 

The analysis proceeded as before. First, we identified a procedure for solving 
each problem. Then, we looked for similarities among the solution rules and identified 
the component rules involved. In general, the required goal figures were not construct- 
able via either the two loci or similar figures higher order rules. However, in each 
case the goal figure could be obtained from an (auxiliary) figure that was constructabjle 
from the given information. In the problem above, for example, a triangle can be con-J 
structed from segments one-third the lengths of the given medians. The goal figure is 
obtained by extending two of the sides of this auxiliary triangle to the respective 
median lengths and drawing lines through the resulting endpoints. s 

The analysis resulted in the auxiliary figures higher order rule shown in Figure 
6. This higher order rule generates a solution rule 

for the illustrative task above as follows, • First, an arbitrary representation for 
the solved problem < Sj^, Rj^) is constructed. In this case, an arbitrary triangle is 
"sketched," and its medians are represented on it. The first decision asks whether there- 
is (1) btC auxiliary figure, and (2) a rule rg which operates on the auxiliary figure« 
and generates the goal figure. In this task, there is such an auxiliary figi^e, a tri- 
angle having sides one-third the lengths of the given medians. In addition, the rule, 
"Extend the constructed segments to their given lengths and draw lines through their 
endpoints," satisfies condition (2). The next decision (III) asks whether or not a 
point is needed, in addition to the auxiliary figure, to construct the goal. Here, the 
answer is "no"; no other point is needed. Finally, decision IV asks If there is an 
auxiliary figure construction rule (r^) available whose domain contains (Sj^g Dom r^) 
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Construct representative <S^, pair. 
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and v?hMe range contains^ the auxiliary figure ii»e*, Ran r^u ^ a|a is iik6 auaj ). 
In this case, thfe rule, '•Coiistruct' a triangle from segments one-third the lengths of 
three given segments (tfledians)^' Satisfies these conditions and operant iort V constructs, 
the solution rule, '^Construct a triangle having sides one-third the length of the given 
medians; extend two segments of the constructed triangle to the respective median lengths 
and draw lines through the endpointe o£ the medians to construct the goal triangle. ' 

The other path through the, higher order rule may be illustrated using the task, 
"Given the f6ur, sides a,b,c,d of a trapezoid (a <c) , .construct the trapezoid/* Again, 
the answer to decision X is "yes." (Where the answer is "no," the higher otder rule 
fails.) The triangle with c-a, b, d/as ^des, serves as the Auxiliary goal figure and 
the goal rule, "Through comer points of)an auxiliary figure and through another point 
not in the auxiliary figure, draw segments to complete the goal," is selected. Unlike 
the first path, however, the answer to decision lll is "yes" since the ^^goal rule (rg) 
acts on pairs (X U AUX) consisting of auxiliary figure ^nd a critical point X. The 
next decision (IV) asks if there is a Vule r^ that constructs the auxiUaty figyre from 
given information. This condition is^satisfied by the r^ rule whfcTi constructs the auxi- 
liary triangle from the, sides of- a trapezoid. Decision VIII asks whether there are two 
locus rules (r^ and r^^i) which apply to the auxiliary figure and/or other„ given, informa- 
tion (Sj) and whol&e ranges contain X. The circle rule (r^) , applied to different portion 
of Sj^U AUX, plays the role of both locus rule^. The solution rule (Operation IX) is a 
concatenation of the component rules. 

Combined two-loci, similar and auxiliary figures higher order rule , 



Taken collectively, the three higher order rules described above can be used to 
instruct solution procedures for a wide range of geometry construction problems. 
Furthermore, they appear compatible both' with human behavior and with the heuristics 
originally- identified by Polya (1962). , , 

This is not meant to imply however, that the three higher order rules are un- 
related to one another. Both the needed point X in tfl^e pattern of two loci, and the 
similar figure in the pattern of similar figures can be regarded as special auxiliary 
figures. Indeed, one c5uld modify the ^duxiliary figure higher order rule so that ^t, 
together with the relevant lower order rules, would account for all three classes of 
problems. In addition, the similar l^nd auxiliary figures higher order rules may be 
vieweoh as progressive generalizations of the tw\)-loci .higher order rule. It is not 
difficult to conceive of third level higher order generalization rules which have the 
two loci higher or<ler rule and a similar or auxiliary figure as 'inputa, and a more gener- 
al higher order rule ia which a similar or auxiliary figure ts substituted for the miss- 
ing foint X, as the csHJ^esponding q^tput. 

Alternatively, the 4:ombined two-loci, similar figures higher order rule (Figure 
4) can be extended to include auxiliary figures. In addition, the extended higher order 
rule depicted lA Figure 7 allows recursion on the higher order rules. 

To see this, notice that the higher order rule shown in Figure 6 
can terminate at several points without finding a solution rule. In some problems this 
is unavoidable; there may not be an auxiliary figure from wh^ch the goal figure can be 
constructed. Sometimes, however, there- is an auxiliary figure, but one which is not 
directly constructable from the given information. Such auxiliary figures can often be 
constructed via the pattern of two-loci, the pattern of similar figures, or the pattern 
of auxiliary figures itself/ In those cases whfere such an auxiliary figure exists, we 
allow for this possibility by returning control to the start of the combined higher / 
order rule in order to derive an rg rule for constructing the auxiliary figure. Onfce an 
auxiliary figure (rg) rule has been? derived, the original procedure resumes. 

To see how tehiiiiigher order rule works, consider th& following task, "Construct 
a trapezoid given the shorter base a, the base anglea A and D, andvthe altitude H^." As 
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in the trapezoid example- given earlier, the needed auxiliary figure is the trianale 
having sides c-a, b, pnd.d. But, this triangle is not directly constructable from the 
given information. Hcfne of the assumed lower orderlies is adequate, so the hteher 
order Vule breaks down at step VI. The flow o/ control therefore returns to step 1 



Zi — r r ""7" "^^v,^'" *ne now oi control therefore returns to step 1 with 

tl iiVl '^^rf'^^J^S auxiliary figure.l^ Beginning here, the problem of construct 

. ing this, auxiliary figure is a stwightforward similar figures task, one in fact which 
we nad sampled* v ' 

The higher order rule of Figur«e 7 also generates solution'~>ules/for even more " 
complex problems, provi^d we assume the necessary component rules. For example, con- 
''i?*!! w P"^^*"** Giyen three noncollinear points A, B, and C, construct a line XY 
which ^tersects se|nrentp^ in the point X and segment in the point Y, such that seg- 
ments AX, XY, and YB are all of the same length." *^ ' 



Figure 8 



; (Pc) 




U4 may wish to derive the solution rule for this more difficult proble^ 

himself. (Hint: Several recursions are required. jPor details see" the unabridged re- 
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Summary 
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In. summary, a quasi-systematic method for characterizing heuristics involved in 
problem solving was proposed and illustrated with compass and straightedge constructions 
in geometry. Higher order-rules, together with corresponding sets of lower order rules, ♦ 
were constructed for the two-loci, similajr figures and auxiliary figures problems iden-' 
tified by Polya (1962) ^ First , the two-loci heuristic of Polya was made precise. We 
saw how' decision making capabilities (decisions), arid particularly^ the conditions used 
to define decisions, play a central role ih higher order rules. Thfe similar figures and 
auxiliary figures heuristics Were similarly formulated. We also showed how the tWo-loci 
and similar figures higher order rules could be combined to form one higher order rule, 
which (together with appropriatdf lower order rules) provides ^ basis for solving both 
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kinds of problems. Finally, a combined two-loci, similar and/or auxiliary figures high- 
'er order rure"^wa8 constructed. This higher order rule allows recursive retumr to com- 
ponents of the higher order rule, corresponding to the individual higher order rutes, 
and was considerably* more powerful than the others. Its use on some compUx problems 

was illustrated. , . , -t. tk« 

Overall, the analyses demonstrated the viability of the analytic method. The , 
higher order rules identified were precise, compatible with the heuristics identified 
.by Polya^ and intuitively seemed to reflect the kinds of relevant knowledge that success- 
ful problem solvers might have. . u j j m. 

The ceiitral role played by Semantics in the analysis should be emphasized, me 
meaning of eacii t^ask was represented by a goal figure <Si, Ri> representing the giverf' 
goal situati<^ (Sq, Rq) • The -relations among, and properties of, the elements of these 
figures, together with the domains and ranges of |lndividual rules, were reflected direct- 
ly in thfi. higher order rulfes. Although little attention was given to. the formal repre- 
sentation of semantic features, the goal figures clearly placed powerful constraints on 
the rules selected at each stage in applying the higher order rules. Representation in 
terms of ^ome arbitrary (e.g., random) syntax, unconstrained by goal figures, would have 
necessitated backup capabilities and, in pftnciple, could easily increase the number^ of 
possibl^^nstriiction rules at each stage beyond any reasonable computational capability. 
That is5 without the coJistraints Imposed by the 'goal figures, the number of possible 
pornts, arcs, and lines that might be constructed could be almost unliflited. The effect 
of using goal figures is very much the same as that referred to by Winston (1972) in a 
recent paper on vision. He argued that although the number of combinatorially passible 
arrangements of vertex types (Guzman, 1968) is very large, the number of types that 
yield real figures is much smaller. ^ 

Limitations " . „ ^ . . 

Nonetheless, the present study has certain limitations which, 'In principle^ could 
be overcome. ^ First, as in existing state space formulations, all of the higher order •. 
operations yere limited to compositions of rules. future research, more attention^ 
should be given to other kinds ofi operations. Generalization, restriction, and selection 
rules (e.g., Scandura, 1973a), for example, might well be expected to play an important 

role in problem solving. , v t j 

There are a variety of ways in which such rules might enter, (a) In discussing 
the two-loci higher order rule, we have already seen how the scope of a decision (making 
capability) may be generalized to generate solution rules for a broader range of P^blema. 
In particular, we saw how the first decision, which was initially restricted to situations 
where the desired point X was a given distance from two given points, could be general- 
ized, for example, to allow the point to be the same distance from two given points. It 
is not hard to envisage a generalization rule by which such shifts might be made. The 
relationships observed previously between the missing points, X,- and the f^^^^]" t"! u"^' 
lliary figures, suggest another kind of generalization involving the identified higher 

orde^yrules^^^^ aVe a wide variety^of construction' proMemS-Jwhich might require the 
independent derivation of more than one missing point X, similar figure, or auxiliary 
figure. As a simple example, consider the task of cotistructing two circles, one of which 
is to be inscribed in a given triangle and the other, to pass through its vertices (i.e., 
to circumscribe the triangle). In this case, the problem can be solved by applying the 
two-loci higher order rule twic'e. The higher order derivation rule here can be thought 
of as a generalization of the two-loci rule in which two or more applications .(I.e., - 
recursions) may be allowed. One can easily conceive of a simple higher order generaliza- 
tion rule which operates on rules and generates corresponding rules which are recursive. 
The combined two-loci, similar and auxiliary figures higher order rule is one possible 
consequence of apply sojne such higher order rule. 
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(c) If we had allowed unsolvable yariarits of the problems considered, truly via- 
ble solution -rules would hiive to be app,:opri^tely restricted. The solution rule for 

^ constructing a triangle with §ides of predetermined length," for example, works only 
■ when the sum of each pair of sides of the triangl^ is greater than the third. A com- 
pletely adequate solution rule would have <to teSf this possibility. It is possible to 
. conceive of higher order .rules, which operate on rules bf various kinds together with 
special restrictions (e.g., the triangle inequality) to generate correspondingly re- 
stricted rules. ^ 

(d) ft is also possible to conceive of thr^e aimensional analogues of ^compass 
and straightedge constructions. In this c^se, the higher order rules wduld operate on 
the usual two dimensional construction rules and would generate their three dimensional/ 
analogues. For examplq, a rule for constructing the locus of points equidistant from' 
given line (i.e., a pair of lii'.es) cbrresponds to a three dimensional' rule which con- 
structs a cylinder about the line.^^ 

A secbnd limitation is that nowhere did deduction "play a r^e in our analysis. 
In solving constructions, real people frequently attempt to justify logiSally the various 
constructions they make. Constructing a triangle given its three medians, for example, 
requires that a person know or deduce the fact that the medians intersect at a point 
two-thirds of the way from each vertex to the oppo&ite midpoint " (see 'footnote 13) 'To 

„ rtiis extent, «our analysis is littitpd and may not adequately .'reflect human knowledge. 
Our rult^s reflect semantics, but nbt inference. Extension of the proposed analysis to 
deduction should be a first order of business. It is likely that existing geometry 
theorem proving systems (e.g., Gelemter, 1959) may be useful in this regard. 

A third major limitation of this research is that cumulative effects of learning 
were not considered: each problem in our analyses was- considered as de novo . If one 
wishes to characterize solutions to problems in a given class (e.g., the two-loci tasks) 
relative to a fixed, self-sufficient set of rules, sonte fairly complex j^les (e.g., the 
angle vertices rule) must be included. Furth^rjnoVe, and in many ways more important, 
5,uch characterizations, at any particular level of analysis in"a task' domain tend to lack 
flexibility. The atomic elements are so large, relatively speaking, that there are many 
intermediate level problems that cannot readily be solved using such rule sets exclusive- 
ly. Also important from the standpoint of behavioral analysis, it is doubtful that such 
lower order rules would adequately reflect the knowledge had by most subjects, assumed to ^ 
know the identified higher order rules. Such subjects .would almost\certainly also know ' 
a^wide. variety of simpler construction rules, even though we might not explicitly include 
them m a rule set determined by sampling complex problems of the sort we used. Future 

<^*work IS planned which is designed to meet many of these ofijections. / 

Future directions ' 

The method of analysis used in the present research is based on Scandura's ^1973a) 
theory of structural learning, more particularly on those aspects of it which deal with 
competence. The aim of the latter is to specify (hopefully mechanizable) procedures 
—^which characterize the knowledge underlying given classes of behaviors (e.g., problem 

solutions) that one might wish to attribute to an idealized knower. As noted, our 
' approach to this problem involve* the. invention of finite sets of rules (including high- 
er order rules which may operate on other rules as well as on data elements) which can 
be applied as indicated for example, to generate problem solutions. 

This level of theory, of course, applies only at an analytic level in the sense 
, generative grammars account for language behavior. The relevance of the theory to actual 
human behavior, or, for that matter, to the design of artificial intelligence systems, 
depends fundamentally on our ability to specify mechanisms by which such rules ar4 to 
interact in specific situations, and what effect iflany such interaction has on the 
nature of the rule set itself. 



15. -Implicit in the above examples is another limitation 5,0 which we have indirectly 
referred previously. Our original analyses were limited almost exclusively to single 
higher brder rules. In no case did we attempt to identify rules which may operate on 
higher order rules, althtough our examples make it clear that we could have done so. The 
problems Involved In accomplishing this would be ptactlcal rather than theoretical. 

— ^ „.„ — , 



the structural aeamlng theory (Scandura» 1973a) Is partly concerned vlth the 
specification of such mechanisms The theory rests on the fundamehtal and widely hleld 
assumption that In problem, solving people are^ attempting to achieve some goal. In the 
simplified version of the theory considered here, the basis mechanism vhlch governs the 
use of available rules is as folloiJs: (K)' The subject tests his available rules (r) to 
see yif one (or more) of them satilsfles the given goal situation (i.e.. If € Dom r and 
Ran r CGoal). If so, the subject will apply it. (B) If a subjedt does not have a rule 
available for achieving a given goal, then control automatical Iv shifts to the higher 
level goal of deriving a procedure which will satisfy the original goal. (C) If a high- 
et level goat has been satisfied (that Is, if some new. rule has been derived which con- - 
tains the stlmi^lus situation In its domain and whose outputs satisfy the original goal 
criterion), the derived rule is added to the set of available ruleg and control reverts 
back to the previous goal. The third hypothesis allows control to ret<im to lower level 
goals once a higher level goal has been satisfied. (For more general and rigorously 
formulated sets of hypotheses see Scandura, 1973a.) 

PuttlAg all this together, we see that if an appropriate higher order rule is 
available when control shifts tP a higher level goal, then the higher order rule Will 
be applied and contro/l will automatically ffevert to the original goal. The subject will 
then apply the newly Iderived rule and solve the problem. If the subject does not haye v 
a higher order rt^le available for deriving a procedure that works, then control is pre- 
sumed to move to still higher levels (e.g., deriving a rule for deriving a rule that 
works). Although this process is assumed to go dn indefinitely in the Idealized thejory, 
memory places strlctf limits in actual applicatllMs. 

Even this simple assumption provides an adequate basis for generating predlctilons 
in a^wl*de variety o^ problem solving situations.' Consider the problem of converting a 
given number of yards ifito inches. There are two possible ways in which a subject might 
solve the problem. The, flrsti^is to simply know, and have available, a rule for convert- 
ing yards directly into inches: "Multiply the number of yards by 36." In this case, 
the subject need only apply the rule according to hypothesis (A). 'The other way is more 
interesting, and involves the entire mechanism as described above. Here, we assume that 
the subject has mastered'one rule for converting yards into feet , and another for con- 
verting feet Into Inches . The subject is also assumed to have mastered a higher order 
composition rule . 

In the second situation the subject does not have an applicable rule which is 
immediately available, and, hence, according to hypothesis (B), he automatically adopts 
the higher level goal of deriving such a procedure. Then, according to the simple per-y 
formance hypothesis (A>, the subject applies the higher order compositlaii "rule to the 
rules for converting yards *lnto feet and feet into Inches. This yields a new composite 
rule for converting yards into Inches. Next, control reverts to the original goal by 
hypothesis (C) ^nd, finally, the subject applies the newly derived composite rule by 
hypothesis (A) to generate the desired response. 

Moreover, this mechanism provides a basis for an efficient characterization of 
learning, since, according to hypothesis C, newly derived rules are added to the know- 
ledge base (rule sgt). Such (additional) rules are in no way dls^ingvilshed from any 
others in the rule set; for example, they may serve as component rules in new higher 
order rule applications. (Also, it should be noted that derived rules may themselves 
be of higher order and may, thus, be used to satisfy future higher level goals.) 

To see how knowledge may cummulate according to this mechanism, let us assume 
that the learner Initially knows rules for converting miles into yards ^ yards to feet . 
feet to inches , and the higher order composition rule above. Suppose also that the 
learner is first presented with the problem of converting miles to inches. In this 
situation, the learner will fall to solve the problem, since the composition rule we 
specified above applies only to pairs of rules, (We assume that it does not apply to 
itself.) However, if the problem of converting yards to Inches is presented first, the 



subject wlllydolve It as before, ^nd derive a yards to Inches rule in the process, 
^ Further, If the mllcs-to-lnches problem is then ^pre sent ed, it can be solved using the 
derived yards to inches ruie and tlie miles to yards rule as components. Although this 
example Is obviously Very simple, i£ does illustrate the potential importance of pro- 
S blem sequence in a growing (learning) system. 

Although other investigators have made use of similar notions in varying degrees, 
the type of mechanism proposed appears to^niake more general use of rule and higher order 
riile constructs. Frequently, for example, procedures, whj^ are cyllowed to operate on 
procedures are not themselves part of the knowledge base;^ they arie viewed as control 
processes. (In the present case, only the learning mechanism itself acts as a control » ^ 
process.) Noj are newly derived solution procedures often added to the set of available 
procedures. Newdll & Simon (1972, p. 135), for example, allow thr Logic Theorist to add 
proved theorems to an initial set of axioms, but .this is essentially at the level of da- 
ta^ upon <yhich proof generation procedures operate, and not at the level of the proce- 
dures themselves. Viewing learning as "debugging", (e.g., Minsky & Papert, 1972) or as 
"means-ends" analysis (Newell & Simon, 1972) is essentially analogous to the introduction " 
of higher order rules except that in these cases implicit restrictions are imposed on 
the allowable higher order rules. 

In any case, most investigations in ar'tificial intelligence have involved some 
kind of state space representation (e.g.^ Niisson, '1971) , with problem solving involv- 
ing some fype of search. No generally agreed ujion way of representing learning seans 
to have emerged, however. Sometimes, learning is treated as the modification of para- 
meters in evaluation functions which select •promising' nodes for expansion (e.g., / 
Samuel, 1959). In other cases, learning systems have been devised to reflect stimulus- 
response principles in psychology ^(e.g., Feigenbaum, 1961, Bower, 1972). Where con- 
sidered by information processing psychologists who have adopted this point' of view (e. 
g., Rumelhart, Lindsay, & Normaji, 1972), learning involves the transformation of one 
state space to another (Scandura, 1973b). 

Though the proposed representation may be formally equivalent, it is our belief, 
based OQ a variety of studies with human subjects (e.g., Scandura, 1973a), that it 1^ 
not psychologically equivalent. For one thing*, our search for basic psychological 
mechanisms (e.g., of learning) , which reflect commonalities in hximan behavior, differs 
in important ways from that in computer simulation, where the essential goal Is to paral- 
lel overt human behavior in complex instances of problem solving and where the' basic 
mechanisms (e.g., means->ends analysis), therefore, are often judged on more imnedlately 
pragmatic grounds. ^ . ^ 

Irrespective of one's opinion on the issue, the laws which govern the injteractrlons 
among individual rules are assumed to be fixed once and for all and have potentially impor- 
taant Implications for computer implementation. In particular, the fixed'mode qf interactioit 
would make it possible in principle to modify and/or extend an artificial intelligence 
system rule by rule, "without having to worry about the effects of these changes on other 
parts. (This latter property appears to some extent to be shared by Newell and Simon's 
(1972) production systems.) 

One of the major complications in current artificial intelligence research is that 
even minor changes in one part of a' system may have unpredictable effects which may re- 
quire compensating changes elsewhere. The switch to heterarchical systems (e.g., Minsky 
and* Papert, 1972) in which control may shift among individual programs in some predeter- 
mined manner, does not appear to alleviate this problem/* In contrast to the above me-t 
chanlsm, the mode of control in heterarchical systems may vary from system to system, 
and worse, from the standpoint of debugging, may interact with the individual programs 
themselves. In short, the important point for artificial intelligence research is the^ 
possible advantage for implementation of a fixed mode of interaction. ^ 

Whether or not the mode of interaction is restricted to that proposed here is 
not the most crucial point*. To the extent that artificial intelligence research may 
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peneric oy calling accounc ot isuch mechanismd^ psychological research aimed at discover- 
ing what these mechanisms are would "appear td be a first order of business for those 
interested in human thought. (For a "richer'^^theoretical mechanism which incorporates 
memory, see Scandura, 1973a, Ch. 10.) 

^ . With the foregoing in mind, an altetnatlf^ve which we are now pursuing is to begin 
initially with rule sets composed of simptier rules, and to allow these rule sets to grow 
gradually by interacting with a problem environment.'^' In the present case, only three 
atomic operators (lower order' rules) will be introduced initially: (a) setting a com- 
pass to a given radius, (b) drawing a straight line (segment), and (c) using a set com-" 
pass to make a circle. It is not immediately clear what the hi?gher order rules should 
be but, presumably, any reasonably satisfactory rule set would include some types of 
simple composition, conjunction, and generalization higher order rules, together, possi- 
bly, with variants of the two loci and other higher order rules identified above. It 
should b6 emphasized in this rega»] thajt the initial selection of ladles would not^in . 
itself be sufficient; the choice ^rd Sequencing of_ tO'-be-solved probt^M may also be 
expected to have important effects dns both ^the rate and type of knowledgtl acquisitiotU 
For obvious reasons, computer Implementation seems almost essential in thisi!. research and ' 
^is'the course we are pursuing. • . T 

^ IMPLICATIONS 

Aftiftcial Intelligence [ ' 

Th¥ present research appears to have three getieral implications for work in simu- 
lation and artificial intelligence. % 

First, the rules we haye identified may be implemented relatively easily (some have 
already been). As such, they would be useful either directly in systems concerned with / 
geometric figures and constriictlons, or Indirectly in research having more encompassing 
aims as described above. ' . ^ 

^ Second, the results aire suggestive of how the construction, of at least certain/ 
artificial intelligence systems might be partially systematized . In this regard, thi 
topic of compass and straightedge constructions is not nearly as important as is the 
fact that the analysis serves as a prototype for the proposed method of analysis. At 
the present time this method is being used to ana^lyze the proofs contained in an experi- 
mental algebra I high school text based on axiomatics. 

Third, our use of flow^ diagramming as a mode of representation of individual rules 
suggests that perhaps such representation might play a somewhat larger role in the expo- 
sition of future artificial intelligence research. The routine use of a large number of 
different and highly technical programming languages is often enough to turn away out- 
siders (such as l>urselve&) t^who might otherwise be interested. The limitations of flow 
diagrams with regard to memory considerations may be a small price to pay for a more 
neutral and faimiliar form of representation. Furthermore, flow diagrams have a flexibil- 
ity as to level of representation which is not shared by particular programming languages. 
This makes it possible to. more readily represent basic components at a level of atomicity 
tailored to immediate needs, and to psychological reality (cf. Scandura, 1973a), rather 
than to basic components determined by some programming language. These comments, of 
course, apply only to psychological and expository considerations and say nothing of the 
more strictly technical problems of representation which must be dealt with in computer 
implementations. 

Education 

The resulcs of this study also have both long range and immediate implications 
for education. The promising nature of the results attests to the practicability of the 
proposed approach as a means of identifying the knowledge underlying reasonably complex 
kinds of problem solving. addition to serving aa a prototype, the identified rules 
themselveai could be helpful in teaching high school students how to solve compass and 
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straightedge construction problems, 

dual rules.. including the hieher ordnr n^!« n °5/"'*Jy""al subjects on the indlvi- 

dual subjects in thif way and comofrlna ^^; °P"f «=i°"alizin8 the knowledge of indivi- 

theory (i.e.. set o^ruJes) oroX^ this knowledge with the initial competence 

& Sc^duri,: 973K I^effect eL2 "'""'^^^l instructio^(Durnin 

each competence i^le E:s^^:f L^te^^^f ^^ those poVti^ns of 

4v order rulea are unknown to hieh schoor«,M^H* J J extent^ that theiident^f ied higfi- 
facilitate problem sowJ^^ ^erJorL e The lil'ost^ t he, | rules ought lo 

these higher order rules has beenTmn.;«^!^ diagnostic and tnstructio^l efficacy of 
Dumm. I £hrenpre;ir^ 1974)? demonstrated, in a recent field test JScandura, Sfelfeck. 

iearner'Suh^^VoblrSJjLlL^r "^IToKT ^^^'^ ^-^-ion ^ithe ' 

assigning values to various oCj' ive anrcoftrto^n^^^^ 

rtales). it should be possible to study t^e nroMo^ " kindfof instruction (or 

mization In a way which is bo tj precise L ^e^4^lf^ 
this as a critically , impottan^ problerf^r f^tirrL^^r^!"'"^ 
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APPENDICES 



Appendix A. —The only really adequate way of .determining whether a rule is conspatible^ 
with human behavior is to effect a behavioral test; that is, to see whether a rule 
provides an adequate basis for assessing the behavior potential^of individual subjects, 
thereby making it possible to predict the behavior of individual subjects on new 
instances (of the rule). fThe theoretical foundations for sucli tests have been worked 
out and tested empirically [Scandura, 1971, 1973a,- Scandura & Durnin, 1973; Durnin L 
Scandura, 1973].) The basic idea is to diat ermine each subject's behavior potential 
with respect to each rule in an identified rule set, and then to use the theory a 
basis for making ^predictions' concerning performance? on problems which require Inter- 
actions among the| rules. The closeness of fit between the predictions and observed 
behavior would provide a direct test of the adequacy of the rule set. A study reported 
In Scandura (1973a) on rule generalization was of this type. Since this was impracti- 
cal in the present study, we adopted the weaker ^nd less rigorous criterion of 'requir- 
ing that the rule sets be compatible with our intuition (cf. Chomsky, 1957). 

Appendix B. —Strictly speaking, human subjects, are presented with statements of prob- 
lems^as stimuli. Throughout this and our subsequent analyses we assume that the sub- 
ject's initial subgoal is to interpret the goal statement (i.e.-, determine its meaning) 
The second subgoal is to solve the problem. In effect, the initial goal is divided 
into a pair of subgoals to be achieved in order. Our analysis is limited to the 
second part of this task, and then only on the assumption thnt there is no further 
divisi-on of the problem into subgoals. We also assume that the given problem state- 
ments can be uniformly and correctly interpreted. 

.Although we do not pursue the question here, we have reason to believe that 
forming subgoals is closely related to the question of (problem) representation 
(cf. Amarel, 1968). ^ - a 

Appendix C. —Other representations would probubly be more efficient for computer imple- 
mentation, since graphic systems are relatively complex to implement. For example, 
some sort of naming system for points, lines, etc. could be devised together with 
appropriate interpretive routines to identify relations of interest among elements - 
In fact, the naming system for triangles in common use^ evolved for just this purpose; 
names. for sides, vertices, medians, etc., if correctly interpreted, carry much infor- 
mation about relative position, intersections, etc. 

\ ^ 

Appendiit D. — In the structural learning theory (Scand ura, 1973a), 'it is assumed i^d^at 
the'problem solver aUtoroatic'ally tests the solution rule Rg to see if it satisfies a 
higher^ level goa^ condition. That is, is So € Dom Rg and Ran Rg c g? If the higher 
level goal is satisfied, control is assumed to revert to the original goal so that Rs 
will be applied. 

Appendix E. — In evaluating alternative rule-based accounts for a given class of tasks, 
decisions must always be made concerning exactly how the computational load should be 
apportioned to the higher and lower order rules.- Any number of alternatives exist; at 
one extreme, the lower order rules may do all of the computation, in which case a 
separate rule would be needed ^for each type of problem, and, at the other extreme, the 
compon^t lower order rules mm be of minimal complexity with the higher order rule 
assuming most of the computational burden. The requirement of compatibility vith 
human knowledge, of course, substantially reduces the number of plausible characteri- 
zations* 
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AppeadlK F.— We do not atteopt to spell out the procedures nec^sary foiP^^flfidlng 
auxiliary figures. However, in all of the sampled auxiliary figures problems, it was.;, 
necessary to construct. a line parallel to some "distinguished" lln§ through some 
"distinguished" point not on that line. Such procedures also frequently require 
special knowledge — for example, that medians intersect at a common point that is / 
2/3 of the distance ixom tXxd respective vertices to the midpoint of opposite sides. 
Such^ knowledge is frequently logically deducible, but for our purposes, may be repre- 
sented in terms of simple "associations" for example, between triangles with their 
niedians and the coEHtSk intersection property. 

■ > * 

Appendix G. — Scandura's (1973b) coiments regarding relationships between the structural 
learning mechanism, and the notion of heterarchlcal control in systenis of artifdcial 
intelligence (Minsky & Papert, 1972) may be relevant here. 

"For a tirae artificial intelligence systems were viewe4 as wholes, as frequent 
coraplex programs. As work in the area ''{progressed, the difficulties of building upon 
earlier work became increasingly clear because of the close interrelationships among 
various parts of such systems. To overcome this limitation, heterarchlcal , or modular 
planning has been used (e.g., Minsky & Papert, 1972). Heterarchlcal systems consist 
of sets of programs (modules) pertaining to syntax, semantics, line detection, and so 
on, together with a heterarchlcal executive which switches control among these "modules 
In accordance with a predetermined plan. 

"Modules in heterarchlcal systems correspond j essentially to rules in the struc- 
tural learning theory; the executiye control structure corresponds to the basic mech- 
anism. There is, however, an Ms^&e^axit difference between the two. In heterarchlcal 
systems, the basic goal is pragmatfic.j Such systems make it easier to modify and build 
upon previous work. No one seriou^y means to imply that heterarchlcal control 
reflects the way people perform, although in developing artificial intelligence 
systems intuitive Judgements are sometimes made with this in mind. 

"In contrast, the structural learning mechanism is assumed to be built into 
people (presumably from birth); it is not learned and need not be taught . While the ^ 
' rules a person knows may increase from time to time, the mechanism is assumed to 
remain constant. \ 

•j^is is a strong claim, something which no responsible person* would make con- 
cerning executive systems currently used in heterarchlcal systems. Among other things, 
it is very unlikely that an existing control system would be useful in systems other 
than the one for which it was designed. It is my contention that benefits might accrue 
in artificial intelligence and, of course, in simulation if structural learning like 
control structures were used [ppf 42-43]." 

Appendix H. — Such rult^ sets have been called InnatQ bases (Scandura, 1^73a, Ch. 5). 
In general, innate bases lack the immediate, direct computing power of comparable 
rule sets'composed of more complex rules but, theoretically at least,' can grow to 
become more powerful. 

Appendix I. — We realize, of course, that some computer Bpeciallsts may no^ take our 
suggestion very seriously. We, however, find the work in simulation and AI highly 
suggestive for our own studies and hope in the interest of interdisciplinary communi- 
cation that some readers may be moved more in this direction. 

1 ■ 
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